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Abstract. We consider an integer lattice quasiperiodic Schrodinger op- 
erator. The underlying dynamics is either the skew-shift or the multi- 
frequency shift by a Diophantine frequency. We assume that the poten- 
tial function belongs to a Gevrey class on the multi-dimensional torus. 
Moreover, we assume that the potential function satisfies a generic 
transversality condition, which we show to imply a Lojasiewicz type in- 
equality for smooth functions of several variables. Under these assump- 
tions, for large coupling constant and for most frequencies, we prove 
that the operator satisfies Anderson localization. Moreover, we prove 
that the associated Lyapunov exponent is positive for all energies, and 
continuous as a function of energy, with a certain modulus of continuity. 



1. Definitions, notations, statement of main results 

In this paper we study the one-dimensional lattice quasiperiodic Schrodinger 
operator H(x) acting on Z 2 (Z) by: 

[H(x) V]„ := -Vwi - 4>n-i + A v(T n x) V>„ (1.1) 



where in equation (1.1): 

■ x = (xi,X2) £ Fis a parameter that introduces some randomness into 
the system; 

■ A is a real number called the disorder of the system; 

■ v(x) is a real valued function on T 2 = (M/Z) 2 , that is, a real valued 
1-periodic function in each variable; 

■ T is a specific ergodic transformation on T 2 , and T n is its nth iteration. 

Some of the questions of interest regarding this, or other related oper- 
ators, are the spectral types (pure point, absolutely continuous, singularly 
continuous), the topological structure of the spectrum, the rate of decay of 
the eigenfunctions, the positivity and regularity of the Lyapunov exponent, 
the regularity of the integrated density of states. 

Due to the ergodicity of the transformation T, the spectrum and the 
spectral types of the Hamiltonian system [H(x)] xG j2 defined by (fTTTj) are 
not random - that is, they are independent of x almost surely (see [9J). 

A stronger property than pure point spectrum is Anderson localization, 
which for the physical model indicates an insulating behavior, while a purely 
absolutely continuous spectrum indicates metallic (conductive) behavior. 
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Let us describe these concepts more formally. 

Definition 1.1. An operator satisfies Anderson localization (AL) if it has 
pure point spectrum with exponentially decaying eigenf unctions. 

Consider now the Schrodinger equation: 

H(x)ip = EtP (1.2) 

for ip = [ip n ]nez C R and Bel 

Due to Schnol-Simon's theorem (see [9]), to prove AL it is enough to show 
that every extended state is exponentially decaying. In other words, if ip is 
a formal solution to the Schrodinger equation (1.2 1 and if ip grows at most 
polynomially \ip n \ < |n| then ip decays exponentially: \ip n \ ^ e _c ' n ' 

The Schrodinger equation (1.2) is a second order finite differences equa- 
tion: 

-lpn+1 ~ 4>n-l + A v(T n X_) 1p n = Elp n 

which becomes 



tpn 



M N (x,E) 



00 



where 



M N (x,E) = M N (x,X,E) := ]J [ 



j=N 



\v(Ti x) 
1 



E 



is called the transfer (or fundamental) matrix of (1.1). 
Define further the function 

L N (x,E)=L N (x,X,E) :=— log\\M N (x,E)\\ 



N 



and its mean 



L N (E)=L N (X,E):= [ ^- T \og\\M N (x,E)\\dx 

Due to sub-additivity, the sequence Lpf(E) converges. 
Definition 1.2. The limit 



L(E) := lim L N (E) 



is called the Lyapunov exponent of (1.1) and it measures the average expo- 
nential growth of the transfer matrices. 

Ergodicity in fact implies that for a.e. x£T 2 , 



L{E) := lim ~log\\M N (x, E) 

N— >oo iV 



(1.3) 



Note that since the transfer matrices have determinant 1, the Lyapunov 
exponent is always nonnegative. An important question is whether it is in 
fact (uniformly) bounded away from 0. This would imply, due to Kotani's 
theorem, absence of absolutely continuous spectrum, and it would represent 
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a strong indication for pure point spectrum. This is also usually the as- 
sumption under which strong continuity properties hold and an analysis of 
the topological structure of the spectrum is more feasible. 

In this paper we prove Anderson localization and positivity and continuity 
of the Lyapunov exponent for large coupling constant, for certain ergodic 
transformations T on T 2 and under certain regularity and transversality 
conditions on the potential function v(x). While all of our results are stated 
and proven for the two-dimensional torus T 2 , their analogues on the higher 
dimensional torus hold as well. 

We describe the assumptions on the transformation and on the potential 
function. 

We start with some notations: for a multi-index m = (mi, m<i) G Z 2 , we 
write \m\ := |mi| + \m,2\ and if m = (mi,m2) G N 2 , then ml := mil • 777-2! 
Moreover, for a = (cti, 012) and m = (mi, 7772) G N 2 , we write a < m when 
u\ < mi and «2 < 7772. 

Throughout this paper, the transformation T: T 2 — > T 2 will represent: 

■ Either the skew-shift 

S w (xi,x 2 ) := (xi + x 2 ,x 2 +u) (1.4) 

where 00 G T is irrational. 
Its nth iteration is given by: 

Tlifl — 1 ) 

S" (x 1 ,x 2 ) = (xi + 77x 2 H 2 UJ ' x 2 + nuj) (1.5) 

■ Or the multi-frequency shift 

T^(xi,x 2 ) := (xi + ui,x 2 +w 2 ) (1.6) 

where to = (ui, L02) G T 2 and u>i,L02 are rationally independent. 
Its nth iteration is given by: 

T^ (x) = x + nuj_ = (xi + nui, X2 + 77w 2 ) (1.7) 

The irrationality / rational independence of the frequency ensures that 
the corresponding transformation is ergodic. However, we need to make a 
quantitative assumption on this rational independence, for reasons that will 
be described later. 

We say that the frequency oj G T satisfies a Diophantine condition DC K 
for some k > if 

dist (Iuj,Z) =: \\Iuj\\> k- —j — - — — ^ for all I G Z \ {0} (1.8) 

|£| [!og(l + |(|)J 

We say that the (multi) frequency lu G T 2 satisfies a Diophantine condition 
DC K for some k > and a fixed constant A > 2, if 

:= ||ZiWi + I2U2W > k ■ forall /GZ 2 \{(0,0)} (1.9) 
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Note that the set of frequencies which satisfy either (1.8) or (1.9) has 



measure 1 — O(k), hence almost every frequency satisfies such a Diophantine 
condition DC K for some k > 0. 

Now we describe the assumptions on the potential function v(x). 
■ We say that a C°° function v(x) on T 2 belongs to the Gevrey class 
G S (T 2 ) for some s > 1 if its partial derivatives have the following bounds: 



sup|d^ v(x)\ < Mifl— l(m!) 

x&T 2 



for all m £ N 



(1.10) 



for some constants M, K > 0. 

This condition is equivalent (see the exercises from Chapter IV in |15| ) to 
the following exponential- type decay of the Fourier coefficients of v. 

-p\l\ 1,a 



\v(l)\ < Me' 



for all I € 



1.11 



for some constants M, p > 0, where v(x) = v(l) < 



2ni l-x 



zez 2 



Note from ( |l.l0| ) or with s = 1 that the Gevrey class G^T 2 ) is the 

class of real analytic functions on T 2 . 

Note also that si < s 2 => G Sl (T 2 ) C G S2 (T 2 ), so the greater the order 
of the Gevrey class, the larger the class. 

The Gevrey-class of any order s > 1 is an intermediate Carleman class 
of functions between analytic functions and C°° functions. They are not, 
however, quasi- analytic (one can easily construct examples or use a general 
test for quasi-analyticity of Carleman classes, as in Chapter V.2 in |15j). 

We will then impose on our potential function v the following generic 
transversality condition (TC). 

■ We say that a function v (x) is transversal if v is not flat at any point: 
For any x G T 2 there is m e N 2 , \m\ / such that d— v(x) / (1.12) 



Non-constant analytic functions automatically satisfy (1.12). Therefore, 



a Schrodinger operator with potential given by a function which satisfies the 



Gevrey regularity condition (1.10) and the transversality condition (1.12) is 



a natural extension of the non constant analytic case considered in [S], [B]. 

We are ready to formulate the main result of this paper. 
Theorem 1.1. Consider the Schrodinger operator (1.1) on Z 2 (Z); 

[H(x) V>]n := -1pn+l ~ i>n-l + \v{T n X) V> n 



where the transformation T is either the skew- shift ( 1.4 ) or the multi-frequency 



shift (1.6). Assume that for some k > the underlying frequency satisfies 



the Diophantine condition DC K described in (1.8) or (1.9) respectively. 



Assume moreover that the potential function v(x) belongs to a Gevrey 
class G S (T 2 ) and that it is transversal as in (1.12). 
There is Aq = Ao(v, k) such that the following hold: 
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■ If |A| > Ao, the Lyapunov exponent is positive for all energies E £ R: 



■ If |A| > Ao, the Lyapunov exponent L(E) is a continuous functions of 
the energy E, with modulus of continuity on any compact interval £ at least: 



where C = C(£.,\,v,k,s) and c, rj are some positive absolute constants. 

■ Fix x G T 2 and A so that |A| > Ao- Then for a.e. frequency in DC K , 
H(x ) satisfies Anderson localization. 

2. Summary of related results, general strategy 

The results in this paper extend the ones in [6] and [5] (see also J. Bour- 
gain's monograph [4j) from non-constant real analytic potential functions, to 
the more general class of Gevrey potential functions satisfying a transversal- 
ity condition. They also mirror similar results obtained for the one-frequency 
shift model on the torus T (see |16j). 

It should be noted, however, that unlike the one or multi- frequency shift, 
the skew-shift, due to its weekly mixing properties, is expected to behave 
more like the random model (presumably regardless of the regularity of the 
potential). In other words, for the skew-shift, these results are expected 
to be independent of the size of the disorder A. Hence one expects that if 
A ^ 0, the Lyapunov exponent is positive and Anderson localization holds 
for all energies. Moreover, one expects no gaps in the spectrum (unlike 
in the one-frequency shift case, when the spectrum is a Cantor set) - see 
the comments at the end of Chapter 15 in [2|. Some results on these very 
challenging problems have been obtained in [2], [3], [18], [19] . 

Localization results for these types of operators defined by skew-shift dy- 
namics have applications to quantum chaos problems. More specifically, 
they imply existence of almost periodic solutions to the quantum kicked ro- 
tator equation. However, one has to establish (dynamical) localization for 
a more general, long range operator, one where the discrete Laplacian is re- 
placed by a Toeplitz operator with fast off-diagonal decay of its monodromy 
matrix entries. This was already established for analytic potential functions 
(see Chapter 15 and 16 in [1]), but we will not address this problem for 
Gevrey potential functions in this paper. 

Most of the results on localization for discrete quasiperiodic Schrodinger 
operators (with either shift or skew-shift dynamics) have been obtained un- 
der the assumption that the potential function is the cosine function, or a 
trigonometric polynomial or a real analytic and non-constant function (see 
J. Bourgain's monograph [1]). 

Assuming Gevrey regularity and a transversality condition, there are lo- 
calization results for the shift model that closely resemble the ones in the 



L(E) > ^ log |A| > 



(1.13) 



h(t) = Ce- c l logt l 



(1.14) 
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analytic case (see [UJ, [E]). It should be noted, however, that they are 
usually perturbative and that more subtle results regarding fine continuity 
properties of the integrated density of states (as in [T2], [T3]) or the topolog- 
ical structure of the spectrum (as in |14j ) are not available in this context. 

For potential functions that are more general than Gevrey, namely C a , 
the results available now (on localization and positivity of the Lyapunov 
exponent) require that a(n asymptotically small relative to the size A of 
the disorder but) positive set of energies be excluded or that the potential 
function be replaced by some generic variations of itself (see [JJ , [7J , [8] ) . 

To prove Theorem [lT] we will follow the same strategy used in |16| for the 
single frequency shift model: at each scale, substitute the potential function 
by an appropriate polynomial approximation (see Section [3]). This in turn 
will allow the use of subharmonic functions techniques (see Section [4| devel- 
oped in [5], [BJ, [3]. An additional challenge is describing the transversality 
condition (1.12) for multi- variable smooth functions in a quantitive way. 
We derive (see Section [5| a Lojasiewicz type inequality for such functions, 
of the kind previously available for non-constant trigonometric polynomials 
(see [17], [10]) or analytic functions (see [21], [T2]). 

The main technical result of this paper, from which all statements in 
Theorem |1.1| follow, is a large deviation theorem (LDT) for logarithmic 
averages of trans fer m atrices (see Section [6]) . 

According to (1.3), due to ergodicity, for a.e. x G T 2 : 

-log\\M N (x,E)\\^L(E) asA^oo 
The LDT provides a quantitative version of this convergence: 

mes [xeT 2 : \^log\\M N (x, E)\\ - L N (E)\ >e]<5{N,e) (2.1) 

where e = o(l) and 6(N, e) — > as A — > oo 

The size of the deviation e and the measure of the exceptional set 5(N, e) 
are very important. The sharpest such estimate (see Theorem 7.1 in |12j). 
available for the single-frequency shift model with analytic potential, holds 
for any e > and exponentially small measure S(N, e) ~ e~ cSN , thus morally 
matching the large deviation estimates for random variables that these de- 
terministic quantities mimic here. Having such sharp estimates leads to a 
sharper modulus of continuity of the Lyapunov exponent (see [12]). 

For the multi-frequency shift and the skew-shift models, even with ana- 
lytic potentials, the available estimates are not as sharp. In this paper, for 
Gevrey potential functions, we will get e ~ N~ T and S ~ e~ N<J for some 
absolute constants r, a 6 (0, 1). 



Following the approach in [3], [BJ, a large deviation estimate like (2.1) will 
allow us to obtain a lower (positive) bound and continuity of the Lyapunov 
exponent, once these properties are established at an initial scale Nq for 
Ln (E). It will also allow us (the reader will be refered to [3j, [6J for details) 
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to establish estimates on the Green's functions associated with the operator 



(1.1), more specifically the fact that double resonances for Green's functions 



occur with small probability, which leads to Anderson localization. 



Most of the paper will then be devoted to proving a LDT like (2.1): 



mes [x£T 2 : |1 log\\M N (x, E)\\ - L N (E)\ > AT T ] < e~ N ° (2.2) 
through an inductive process on the scale N. 



The base step of the inductive process for proving the LDT (2.2) is based 



exclusively on the transversality condition (1.12) on the potential, and on 
choosing a sufficiently large disorder A. The latter is what makes this ap- 
proach perturbative (and, in the case of the skew-shift model, wasteful, 
since it does not exploit the weakly- mixing properties of its dynamics). The 
former implies a Lojasiewicz type inequality, which we prove using a quan- 
titative form of the implicit function theorem. 

In the inductive step we use the regularity of the potential function v(x) 
and the arithmetic properties of the frequency. The regularity of v(x) allows 
us to approximate it efficiently by trigonometric polynomials vn(x) at each 
scale N, and to use these approximants in place of v(x) to get analytic 
substitutes Mn(x) for the transfer matrices Mn(x). Their corresponding 
logarithmic averages will be subharmonic in each variable which will allow 
us to employ the subharmonic functions techniques developed in [1] , [5] , [6] . 

The main technical difficulty with this approach, and what restricts it to 
Gevrey (instead of say, C a ) potential functions, is that the holomorphic ex- 
tensions of the transfer matrix substitutes M^{x) will have to be restricted 
to domains of size ~ N~ s for some 5 > 0. In other words, the estimates 
will not be uniform in N, and this decreasing width of the domain of holo- 
morphicity will have to be overpowered. This will not be possible for a C a 
potential function because its trigonometric polynomial approximation is 
less efficient, so the width of holomorphicity in this case will decrease too 
fast (exponentially fast). 

3. Description of the approximation process 
Let v S G S (T 2 ) be a Gevrey potential function. Then 

v(x)=Y J v(l)e 2ml - X - (3-1) 

lez 2 

where for some constants M, p > 0, its Fourier coefficients have the decay: 
\v(t}\ < Me-f>^ 1/s for all {eZ 2 (3.2) 
We will compare the logarithmic averages of the transfer matrix 

Ln(x,E) = Uog\\M N (x,E)\\dx=± log||n [ Xv ^f~ E - 1 

j=N 



(3.3) 
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with their means 



L N (E)= / L N (x,E)dx (3.4) 
Jt 2 

To be able to use subharmonic functions techniques, we will have to ap- 
proximate the potential function v(x) by trigonometric polynomials vat(x) 



and substitute v by vn into (3.3). At each scale N we will have a different 
approximant vn chosen in such a way that the "transfer matrix substitute" 
would be close to the original transfer matrix. The approximant vn will 
then have to differ from v by a very small error - (super)exponentially small 
in N. That, in turn, will make the degree deg vn =: N of this polynomial 



very large - based on the rate of decay (1.11) of the Fourier coefficients of 
v, N should be a power of N, dependent on the Gevrey class s. 

The trigonometric polynomial vn(x) has an extension vn(z), z = (z±, Z2), 
which is separately holomorphic on the whole complex plane in each variable. 
We have to restrict vn(z) in each variable to a narrow strip (or annulus, if 
we identify the torus T with M/Z) of width p^, where pn ~ (deg ww) 1 ~ 
N^ 1 ~ N~ e , for some power 9 > 0. This is needed in order to get a uniform 
in N bound on the extension vn(z). Moreover, in the case of the skew-shift, 
this is also needed because its dynamics expands in the imaginary direction, 
and in this case, the width of holomorphicity in the second variable will have 
to be smaller than in the first by a factor of ~ jr. 

The fact that the "substitutes" vn(x) have different, smaller and smaller 
widths of holomorphicity creates significant technical problems compared to 
the case when v (x) is a real analytic function. It also makes this approach 
fail when the rate of decay of the Fourier coefficients of the potential function 
v(x) is slower. 

Therefore, we have to find the optimal "error vs. degree" approximations 
of v(x) by trigonometric polynomials vn(x). Here are the formal calcula- 
tions. 

For every positive integer N, consider the truncation 

v N (x) := HOe 2 ™ 1 - (3.5) 

\1_\<N 

where iV = deg vn will be determined later. 

Since vjsr(xx, X2) is in each variable a 1-periodic, real analytic function on 
M., it can be extended to a separately in each variable 1-periodic holomorphic 
function on C: 

vn(z) := HQe 2 ™ 1 - (3-6) 

\1_\<N 

To ensure the uniform boundedness in N of ujv(£i> £2) we have to restrict 
vn{z\,Z2) to the annulus/strip [\^sz\ < pi 7 n] x [\^ z \ < Pi,n], where 

Pi,n ■— 

Indeed, if z\ = x\ + iy±, Z\ = x\ + iy\ and 1 2/1 1 , 1 2/2 ] < Pi,n, then: 



SCHRODINGER OPERATORS WITH MULTIVARIABLE GEVREY POTENTIALS 9 



\v N ( Zl ,z 2 )\ = I me 2mh -\ < E \HQ\e- 27T - V - 

\1\<N \1\<N 

<M^2 e~^ |1/s e |il|l?/l|+|/2|l?/21 <M^ e -p\i\ 1/s e \L\pi,N 

\1\<N \1\<N 

<M^ ^ P| - |1/S • e l^/ 2 l^ 1+1/s =M £ e - - 

|J|<JV |/|<7V 
3 -fKl 1/s 



-em 1 /* 



<#^e^l 1/s =:B<co 



where i? is a constant which depends on v (not on the scale N) and we have 
used : \ Vl \ , \y 2 \ < pi, N = f iV"- 1+1 / s < %\l_\- 1+1 / s for |/| < N, since s > 1. 

We also clearly have \v(x) — vm(x)\ < e~ p ^ 1/s for all x G T 2 . 

We will need, as mentioned above, super-exponentially small error in 
how vn(x) approximates v(x), otherwise the error would propagate and the 
transfer matrix substitutes will not be close to the original transfer matrices. 
Hence N should be chosen such that say e~ p ^ 1/a < e~ pN ' 1 ' . So if N := N 2s , 
then the width of the holomorphic (in each variable) extension vn{z) will 
be p hN = 2jV 2s (- 1+ i) = |AT-2(s-i) = . e N -S t where ,y := 2 (a - 1) > 0. 

We conclude: for every integer N > 1, we have a function vn(x) on T 2 
such that 

\v(x) - v N {x)\ < e~ pN2 (3.7) 

and vn(x) has a 1-periodic separately holomorphic extension vn(z) to the 
strip [\$>z\ < Pi,n] x [1^1 < Pi,n], where pi t N = %N~°, for which 

\v N (z)\<B (3.8) 

The positive constants p, B, 5 above depend only on v (not on the scale N). 
The constant 6 depends on the Gevrey class of v: 5 := 2(s — 1) so it is fixed 
but presumably very large. 

We now substitute these approximants vn(x) for v(x) in the definition of 
the transfer matrix Mn(x). 
Let 

\v(x)-E -1 
1 



A(x,E) :-- 



be the cocycle that defines the transfer matrix M^{x). 
Consider then 



A N (x,E) :-- 



\vn(x) — E —1 
1 
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which leads to the transfer matrix substitutes 

1 

M N (x,E) := H A N (Tix,E) 

j=N 

To show that the substitutes are close to the original matrices, we use 
Trotter's formula. This is a wasteful approach, and clearly in part responsi- 
ble for our inability to apply these methods beyond Gevrey functions. There 
are other, much more subtle reasons for why this approach is limited to this 
class of functions. 



M N (x) - M N (x) 



N 



AT N x) . . . A(T j+1 x) [A(T j x) - A N {T j x)} A^T^x) . . . A N (Tx) 

3=1 

" Av(T%) - \v N {Tix) 




A(T j x) - A N (T j x) 



so 



\\A(T*x)-A N (T*x)\\ < |A| S up\v(y) -v N (y)\ < |A| e"^ 2 

j/GT 2 

Since sup^g-jp \v(x)\ < B, the spectrum of the operator H(x) is contained 
in the interval [— 2 — |A|B,2 + |A|5]. Hence it is enough to consider only 
the energies E such that \E\ < 2 + |A| B. We then have: 



PCFx)|| = 
and 



Av(TJ'x) — E -1 
1 



< |A| B + \E\+2 < 2\X\B + 4 < e 



5(A) 



\\A N (T j x)\\ < 
Therefore, 



\v N {Tix) - E -1 
1 



< |A| B + \E\ +2 < e 5(A) 



\\A(Vx)l\\A N {Vx)\\<e s ^ (3.9) 

where S(X) ~ log | A | is a scaling factor that depends only on the (assumed 
large) disorder A and on v (the constants inherent in ~ depend on the 
number B = B(v) which also determines the range of spectral values E). 
We then have: 

N 

\\M N (x,E)-M N (x,E)\\ <J2e S(X) 

3=1 

< e NS(\)~ P N 2 < e -f N 2 

provided N > 5(A). 

Hence uniformly in x £ T 2 we get: 



e 5(A)| A | e -^ e 5(A) ___ e S(A) < 



\\M N (x,E)-M N (x,E)\\ < e-2 



ZN 2 



(3.10) 
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provided we choose 

N>S(X) (3.11) 
which means roughly that A has to be at most exponential in the scale N. 

We are now going to turn our attention to the logarithmic averages of the 
transfer matrices. 

Since det Mj^(x) = 1 and det-Mjv(^) = 1, we have that ||Mjv(x)|| > 1 and 
\\M N (x)\\ > 1. Thus, for all N > S(X) and for every x G T 2 , 

\±log\\M N (x)\\ - ±Iog\\M N (x)\\\ < ±\\M N (x) - M N (x)\\ < e~i N * 
Recall the following notation: 

L N (x,E) = ^log\\M N {x,E)\\dx (3.12) 



and define its substitute: 



u N (x,E):=~log\\M N (x)\\ (3.13) 



Therefore, uniformly in x € T and in the energy E: 



\Ln(x, E) — un(x, E)\ < e 



.en 2 

2 JV 



and by averaging in x: 



\L N (E)-(u N (E))\<e-% N2 



where L]y(E) := jj 2 Ln(x, E) dx and for any function u(x), (u) := f T2 u(x)dx. 

The advantage of the substitutes un(x) is that they extend to pluri- 
subharmonic functions in a neighborhood of the torus T 2 , as explained be- 
low. 

For the skew-shift transformation T = S w we consider the strip A n := 

En 

< pi, N ] x [\$z\ < P 2,n] where p 1)N = ^N~ s and p 2>N := %f = f N' 6 ' 1 
We have to reduce the size of the strip in the second variable to account 
for the fact that the skew-shift expands in the imaginary direction. Our 
approximation method required a reduction in the size of the holomorphic- 
ity strip at each scale, and this additional reduction will be comparatively 
harmless. 

If we extend the map S w from T 2 = (M/Z) 2 to C 2 , by 
S w (zx,z 2 ) = (zi + z 2 , z 2 + uj) 



we get as in (1.5) that 

TiOn — 1) 

S2(si) z 2 ) = {z\ + nz 2 H u, z 2 + nu) 

Then if (zi, z 2 ) £ A D and if we perform n < N iterations, we have: 

En 

+ nz 2 + n(n ~^ uj)\ = |9f(zi + nz 2 )\ = \ yi + ny 2 \ < ^N' 5 (3.14) 
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The matrix function 

A N {x\ 



Xvn(x) — E 



1 







extends to a 1-periodic, separately in each variable holomorphic matrix val- 
ued function: 

Xv N (z) -E -1 
1 



A N {z) r- 



Using (3.8) and the definition of the scaling factor S(X), we have that on 
the strip A p the matrix valued function An{z) is uniformly in N bounded 



by e s ( x \ Combining this with (3.14), the transfer matrix substitutes extend 



on the same strip to separately holomorphic matrix valued functions 

l 

M N (z,E) := 11 A N (Slz,E) 
j=N 

such that, for all z E A and for all energies E we have 



Therefore, 



\\M N (z,E)\\ <e NS ^ 



u N (z) := -log\\M N (z)\\ 



is a pluri-subharmonic function on the strip A n , and for any z in this strip, 

\u N (z)\ < S(X) 

The same argument applies to the multifrequency shift T = T w . The 
extension of this dynamics to the complex plane 

T^(zi, z 2 ) = (zi +ujx,z 2 + uj 2 ) 

does not expand in the imaginary direction, so there is no need to decrease 
the width of the strip in the second variable as in the case of the skew-shift. 
However, for convenience of notations, we will choose the same strip A n 

£-JV 

for both transformations. 

We can now summarize all of the above into the following. 

Lemma 3.1. For fixed parameters A, E, for a fixed transformation T = S u 
or T = T^ and for 5 = 2(s — 1) ; at every scale N we have a 1-periodic 
function 

u N (x) := ^\og\\M N (x)\\ 
which extends to a pluri-subharmonic function uj^(z) on the strip A p = 
< pi,jv] x < P2,n], where ~ N~ s , p 2 ,N ~ N^ 6 ^ 1 so that 

\unU)\<S(X) for all z£A (3.15) 



Note that the bound (3.15) is uniform in N. 
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Moreover, if N > S(X), then the logarithmic averages of the transfer 
matrices M^(x) are well approximated by their substitutes un(x): 

lllogllM^^H-n^^l^e-^ 2 (3.16) 

\L N -(u N }\ <e~ N2 (3.17) 

All the inherent constants in the above (and future) estimates are either 
universal or depend only on v (and not on the scale N) so they can be 
ignored. The estimates above are independent of the variable x, the param- 
eters A, E and the transformation T. 

This s a crucial technical result in our paper, which will allow us to use 
subharmonic functions techniques as in [1], [6] for the functions ujy, and then 
transfer the relevant estimates to the rougher functions they substitute. 

The logarithmic averages of the transfer matrix have an almost invariance 
(under the dynamics) property: 

Lemma 3.2. For all x £ T 2 , for all parameters \,E and for all transfor- 
mations T we have : 

\^\og\\M N (x)\\ - ilog||M*(Tx)[| | < ^ (3.18) 



Proof. 



1 \og\\M N {x)\\ - ±-\og\\M N {Tx)\\ I = ji-log- l|A/v( ^ l! 



N to " w " N b " v 1 JV b ||Mjv(T x)|| 

_ I 1 lo ^(T^) • • • • • A(T 2 x) ■ A(Tx)\\ 



I N -e, p( T Ar+i^ . A(T N x) • . . . • A{T 2 x)\\ 1 
< llogtlK^T^x))- 1 !! • \\A(Tx)\\] < M 
where the last bound is due to (|3.9|). The inequality (3.18) then follows. □ 



4. Averages of shifts of pluri-subharmonic functions 



One of the main ingredients in the proof of the LDT (2.2) is an estimate 
on averages of shifts of pluri-subharmonic functions. These averages are 
shown to converge in a quantitative way to the mean of the function. The 
result holds for both the skew-shift and the multi- frequency shift. 

For the skew-shift, the result was proven in [6] (see Lemma 2.6 there). 
We will reproduce here the scaled version of that result, the one that takes 
into account the size of the domain of subharmonicity and the sup norm of 
the function. The reader can verify, by following the details of the proof in 
[B], that this is indeed the correct scaled version. 

For the multi- frequency shift, the result is essentially contained within 
the proof of Theorem 5.5 in [1], but for completeness, we will include here 
the details of its proof. 
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Proposition 4.1. Letu(x) be a real valued function onT 2 , that extends to a 
pluri-subharmonic function u(z) on a strip A p = [|3^i| < pi] x [|3&2| < P2\- 
Let p = minjpi, P2}. Let T be either the skew-shift or the multi-frequency 
shift on T 2 , where the underlying frequency satisfies the DC K described in 



(1.8) or (1.9) respectively. Assume that 



sup \u(z)\ < S 

z£A„ 



Then for some explicit constants oq, to > 0, and for n > n(n) we have: 

S 
P 



mes 



1 n— 1 

x) — (it) > 



(4.1) 



j=0 



Here is how this estimate can be understood. Given the ergodicity of 
the transformation T for irrational (or rationally independent) frequencies, 
on the long run, the orbits T J x of most points x will tend to be fairly well 
distributed throughout the torus T 2 (see the picture below). The average 
~ X^=o u(T^x) will then resemble a Riemann sum of the function u(x) and 
as such, it will approach the integral (u). 



1 . 1 .* 

"J ."" ! 4f "» ~ 


* . '. 


-V-*- 






- % J- - - - 


' 1 * 1 • i 


i A _;_ 










••.%•" • 




■ 










,«J « 




V 


■ ' * * 




" * r > 1 












" j 






. 






_ . _ j_ • .* 
* 1 1 


• *1 "* 








■ •;■ ■ 

a- * 


I r 








* * ■ 




■ r 


r 


1 


1 





(a) Iterations of the skew-shift 



(b) Iterations of the 
multifrequency shift 



Moreover, a quantitative description of the irrationality (or rational in- 
dependence) of the frequency in the form of a Diophantine condition like 



(1.8), (1.9), should lead to a quantitative description of the convergence of 
the average sum to the integral (it). 

To prove this quantitative convergence result, we consider the Fourier 
expansion of the function u(x) and apply it to the average sums. This 
leads to a convolution of u(x) with a Fejer-type kernel. It is crucial to 
have estimates on the Fourier coefficients of the function it, and they are 
obtained via Riesz' representation theorem for subharmonic functions (see 



SCHRODINGER OPERATORS WITH MULTIVARIABLE GEVREY POTENTIALS 15 

Corollary 4.1. in [1]). Since u(zi, z<i) is pluri-subharmonic, the scaled version 
of Corollary 4.1. in [1] implies: 

SI SI 

sup |«(Zi,x 2 )| < jz-r and sup \u(xi, h)\ < 777 (4.2) 

i 2 eT Pi I'll xiGT P2 |«2| 



The estimates (4.2) imply (small) upper bounds on the L 2 - norm of the 
part of the Fourier expansion for which at least one of the indices h and 
I2 is large. The difficult part is when both indices l\ and I2 are small, in 
which case we use the Diophantine condition on the frequency to estimate 
the resulting exponential sums. 

In the case of the skew shift dynamics (1.5), the resulting exponential 
sums are quadratic, and they are estimated using WeyPs method (see |B] for 
the details of the proof). We will now present the details of the proof for 
the multi- frequency shift case Tx = T^x := x + u. 

Proof. Expand u(x) into a Fourier series 

u(x) = (u)+ E HD-e 2 ™ 1 - 

lex 2 

Then the averages of shifts have the form 
n— 1 ., 71—1 



n n 
3=0 j=0 



- u(T^x) = ^ E + ^ 

3=0 

- IL L 

(„>+ e a© •^■(^E 

!GZ 2 3- 
1^(0,0) 

(u)+ e m-^ il --K n {i 



n-1 

2wijl-u 



where we denoted by K n (t) the Fejer kernel 

1 71—1 



n ^— ' n 1 — e 2mt 

i=o 



which clearly has the bound 

|# n (i)| <min{l,-^} (4.3) 



We then have 

n-1 



^E^+^)-m ' = E l«(oM*na-^l : 



Z, 2 (T 2 ) 

E l fi U)l 2 - |^na-^)| 2 + e KDM^a-^f 



in 

i=o igz : 

1/(0,0) 



1<\1\<K \1\>K 



16 



SILVIUS KLEIN 



We will estimate the second sum above using the bounds (4.2) on the Fourier 
coefficients of u(x) and the first sum using the DC (1.9) on the frequency u_. 
The splitting point K will be chosen to optimize the sum of these estimates. 
Clearly (14. 21) implies: 



and 



£ \u(h,h)\ 
£ \u(h,h)\ 



u(h,x 2 ] 



u(xi,l 2 ) 



< 



< 



S_ _1_ 
Pi \h 



S 1 



< 



< 



S\2 1 



P J \h 

S\2 1 

p< 



\h 



Then we have: 



£ |«a)| 2 -|^n(i-u;)| 2 < £ I^)l 2 

\1\>K \1\>K 

sy i_ 



< £ l^)l 2 + E l fi 0DI 

«: /: |/2|>^/2 



< 



Estimate (4.2) clearly impies: 



5 1_ 

rs-< I ; I 

P HI 



Then using the DC (1.9) on a; and (4.3), we obtain: 

'S\2 



£ |n(I)| 2 -|A'„(I-a;)| 2 < 
1<\1_\<K 



P 



£ A- 



1 



1<I«I<^ 
2A 



-IJ ^ l/l 2 n 2 K 2~ 



n 2 ||/ • u\\ 2 

sy k 2A 



1<\1\<K 



o 9 \ / 9 9 



We conclude: 

n-1 



| - £ "(^ + M) - (l 



3=0 



Z, 2 (T 2 ) 



< 



1 



5 



/3 VifV2 



+ 



71K 



< -n" a 
P 



for some positive constant a that depends on A and for n large enough 
depending on A and k. 

Using Chebyshev's inequality, the above estimate implies: 

1 n ~ 1 <? 



mcs 



j=0 



This is not exactly what we wanted, since the size of the "bad" set above 
decays only polynomially fast in n, instead of exponentially fast. 

To boost this estimate, we will use Lemma 4.12 in J. Bourgain's mono- 
graph [4J. This result shows that a weaker a-priori estimate on a subhar- 
monic function implies an upper bound on its BMO norm, which in turn 
leads, via John-Nirenberg inequality, to a stronger estimate on the function. 
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We reproduce here a "rescaled" version of the estimate in one that takes 
into account the width p of subharmonicity. The reader may verify that this 
is indeed the correct rescaled version of the statement. 

Lemma 4.1. Assume that u = u(x): T 2 — > K has a pluri-subharmonic 
extension u(z) onA p = < pi] x [jQ^I < P2] such that sup < B. 

Let p = mm{p 1 ,p 2 }. If 

mes [1 £ T 2 : \u(x) - (u)\ > e ] < ei (4.5) 
then for an absolute constant c > 0, 



mes [1 £ T 2 : \u(x) - (u)\ > e 1/4 ] < e c ^° 1/4+ V p ^m) ( 4 . 6 ) 
We will apply this result to the average 



j=0 

Clearly u^(x) is pluri-subharmonic on the same strip A p as u(x), its upper 

bound on this strip is B = p and its mean is (u")= ^ (u) 

o 

Then (4.4) implies 

mes [i£T 2 : (^(^-{n 11 }) > e ] < e x (4.7) 

where eo := n _a / 3 and e\ := n~ 4a//3 so ei <C eo- 

Applying Lemma 4.1 and performing the obvious calculations, from in- 
equality (4.6) we get 

mes [i£T 2 : \u\x)-{J)\ > n~ a / 12 } < e -^ a/12 
which then implies (4.1) for the multi- frequency shift T w . □ 



5. LOJASIEWICZ INEQUALITY FOR MULTIVARIABLE SMOOTH FUNCTIONS 



To prove the large deviation estimate (2.2) for a large enough initial 



scale iVo, we will need a quantitative description of the transversality con- 



dition (1.12). More precisely, we will show that if a smooth function v(x) is 
not flat at any point as defined in (1.12), then the set [x: v(x) ~ E] of points 
where v (x) is almost constant has small measure (and bounded complexity). 

Such an estimate is called a Lojasiewicz type inequality and it is already 
available for non-constant analytic functions. For such functions it can be 
derived using complex analysis methods from [20], namely lower bounds for 
the modulus of a holomorphic function on a disk (see Lemma 11.4 in |12|). 

For non-analytic functions, the proof is more difficult. Using Sard-type 
arguments, we have obtained a similar result for one-variable functions (see 
Lemma 5.3 in [E]). For multivariable smooth functions, the argument is 
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more technical and it involves a quantitative form of the implicit function 
theorem, also used in [7] and [II] . 

We begin with a simple compactness argument that shows that in the 
TC (1.12) we can work with finitely many partial derivatives. 

Lemma 5.1. Assume v(x) is a smooth, 1-periodic function on M 2 . Then 
v(x) satisfies the transversality condition (1.12) if and only if 

3m eN 2 |m|^0 3c>0: VxGT 2 max\d^v(x)\ > c (5.1) 



The constants m,c in (5.1) depend only on v. 



(|1.12|). We prove the converse. The TC ( 1.12 ) implies 



Proof. Clearly fl5.1) 

VieT 2 ]ra,6 N 2 \m x \ / such that \d-^v(x)\ > c x > 

Then there are radii r x > so that if y is in the disk D(x,r x ) we have 
\d— -v {y)\ > c x > 0. The family {D(x, r x ): x £ T 2 } covers T 2 . Consider a 
finite subcover {D(x_i, r 5 ), . . . , D(x k , r Xk )}. Let m 6 N 2 such that m>m 



for all 1 < j < k and c := min - Then (15. II) follows. 

i<j<k ~ 3 



■3 
□ 



The following is a more precise form of the implicit function theorem 
(which was also used in |14j). 

Lemma 5.2. Let f(x) be a C 1 function on a rectangle 31 = I x J C [0, l] 2 , 
let J = [c,d] and A := m&x\d Xl f(x)\. Assume that 



min I dx 2 f(x) 

X<=Ji 



--■■e >0 



(5.2) 



If f (01,02) = for some point (01,02) £ 31, then there is an interval Iq 
(ai — k, 02 + k) C / and a C 1 function 4>o(xi) on Io such that: 

(i) f{xi, 4>o{xi)) = for all x\ £ I 

(ii) \d xl <j>o(xi)\ < A eg 1 

(Hi) x\ £ Io and f(x\,X2) = x<i = 4>o{x\) 

Moreover, the size k of the domain of (po can be taken as large as k 
cqA^ 1 ■ min{a2 — c,d — 02}- 



Proof. Prom (5.2), since d X2 f(x) is either positive on 31 or negative on 31 (in 
which case replace / by — /), we may clearly assume that in fact: 

mmd X2 f(x) =: e > (5.3) 

Moreover, note that for any fixed x\ 6 /, since d X2 f(x\, X2) / 0, the 
equation f(x\, X2) = has a unique solution X2- 
Let x\ £ Jo- Then 

|/(xi,a 2 )| = |/(xi,a 2 ) - f(ai,a 2 )\ = \d Xl f(£, a 2 )| • \x\ - a\\ < Ak 

We have two possibilities. 



SCHRODINGER OPERATORS WITH MULTIVARIABLE GEVREY POTENTIALS 19 

■ < f(x\, a 2 ) < Ak. Then, if a 2 — t G J we have: 

/(xi,a 2 -t) - f(x 1 ,a 2 ) = d X2 f(xi,g) ■ (-t) 

f(xi,a 2 - t) = f{xi,a 2 ) - t ■ d X2 f(xi,C) < Ak - te = 
provided t = Ae$ 1 k 

For this choice of t, a 2 — t is indeed in J, because of the size kq of the 
interval Iq: t = Ae$ 1 k < AeQ 1 eoA~ 1 (a 2 — c) = a 2 — c, so a 2 — t > c. 
Therefore, 

f(xi,a 2 - t) < < f(xi,a 2 ) 
so there is a unique x 2 =: (/>o(xi) G [a 2 — t, a 2 ] such that f(xi, (f>o(x%)) = 0. 

■ — Ak < f(x±,a 2 ) < 0. Then, if a 2 + t G J we have: 

/(xi,a 2 + i) - /(xi,a 2 ) = d X2 f(xi,C) ■ t 
f(xi,a 2 +t) = f(x 1 ,a 2 ) + t ■ d X2 f(x 1 ,(,) > -Ak + te = 
provided i = x k 

As before, for this choice of t, a 2 + t is in J, because of the size k of the 
interval Iq: t = Ae$ 1 k < Ae$ l e$A~ l (d — a 2 ) = d — a 2 , so a 2 + t < d. 

Therefore, 

f(xi,a 2 ) < < f(x 1 ,a 2 + t) 

so there is a unique £2 =: </>o(£i) G [02, «2 + 1] such that /(xi, 4>q(xi)) = 0. 

We proved (i) and (iii). The fact that 4>o(xi) is C 1 follows from the stan- 
dard implicit function theorem, while the estimate (ii) follows immediately 
from (i) using the chain's rule. 

□ 

The following is a quantitative and global version of the previous lemma 
(see also Lemma 8.3 in [H]). It says that under the same conditions as 
above, the points (21,22) G 31 for which |/(xi,X2)| < e are either in a 
narrow strip at the top or at the bottom of the rectangle 31, or near the 
graphs of some functions (j)j(xi), in other words x 2 ~ (j)j(xi). 

Lemma 5.3. Let f(x) be a C 1 function on a rectangle 31 = I x J C [0, l] 2 , 
where \I\ ~ kq. Let J = [c,d] and A := max \d xi f(x)\ . Assume that: 

mm\d X2 f(x)\ =: e > (5.4) 

Let e\ > be small enough, i.e. e± < and k\ ~ e\A~ x . 

Then there are about kqk^ 1 sub-intervals Ij of length k\ covering I, and 
on each interval Ij there is a C 1 function <fij(x\) such that: 

(i) f(x 1 ,(f) j (x 1 )) = for all xi G Ij 

(ii) \d Xl cj)j(xi)\ < AfQ 1 

(iii) [{x\,x 2 ) G 31: \ f(x\,x 2 )\ < ei] C 3$ U 3i b U (Uj Sj) 
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Proof. Divide the interval /, whose length is ~ k,q into ~ kqk 1 1 sub-intervals 
Ij of length Ki each. 

If |/(xi,X2)| > ei for all (21,22) G Ij x [c + 2ci€q , d — 2eieQ 1 ], then we 
are done with the interval 

Otherwise, assume /(ai,a 2 ) < e\ for some a\ G and 02 G [c + 
2e 1 e \ci-2eie 1 ]. 

We may assume < /(ai, 02) < ei, the other case being treated similarly. 
Then if a<i — t G J we have: 

f(at,a 2 -t)- f(ai,a 2 ) = d X2 f(ai,£) ■ (-t) for some £ G (a 2 - t,a 2 ) 

f(a 1 ,a 2 - t) = f (0,1,02) - t ■ d X2 f(ai,g) < e\ - e t = 
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provided t = e\e . Since 02 > c + 2eie , for this t we have 02 — t G J. 

We then have /(ai,a 2 — i) < < 7(01,02), so /(ai,a|) = for some 
a* 2 G [a 2 - i, a 2 ]. 



We can use Lemma 5.2 around the point (ai, a^). The interval we get has 
length at least eo^4 _1 • min{a2 — c,d — 02} > €qA _1 • 2eieg 1 = 2e\A~ l > 2k±, 
so it contains Ij , whose length is ~ K\ . We have a C 1 function <pj on Ij such 
that l^a^^l < Ae$ 1 and 

Xi € Ij and f{x\,x%) = X2 = (j>j{x\) 

Now let (xi,x 2 ) G % such that /(xi,X2) < ei- Then either (xi,x 2 ) G 
ft* U ft fe or (xi, x 2 ) G Ij x [c + 2ei£Q , d — 2eie$ l ] for some j, in which case: 

ei > |/(xi,x 2 )| = |/(xi,x 2 ) - f(x 1 ,<f) j (xi))\ = 
= \d X2 f(xi,€)\ ■ \x 2 - 4>j{xi)\ > e • |x 2 - 4>j(x{)\ 
from which we conclude that \x2 — <f>j(xi)\ < €\€q 1 . 

□ 

We have shown that the points x = (xi, X2) G ft for which |/(x)| < t\ are 
within ~ e\ from the graphs of some functions <j>j(xi) that have bounded 
slopes and are defined on small intervals Ij. This shows that the 'bad' 
set [x G ft: |/(x)| < ei] can be covered by small rectangles instead of t\- 
neighborhoods of curves, and we have control on the size of these rectangles 
and on their number. In turn, the 'good' set [x G ft: |/(x)| > ei] can be 
covered by a comparable number of rectangles, which can be further chopped 
down into squares, to preserve the symmetry between the two variables. This 
is the content of the following lemma. 

Lemma 5.4. Given a C 2 function fix) on a square ft = Iq x Jq C [0, l] 2 , 
where \Iq\, \Jq\ ~ kq. Denote A := max max 19— f(x)\. Assume that: 

\a\<2 x& 

min \d X2 f(x)\ =: e > or min \d xi f(x)\ =: e > (5.5) 

Let ei > be small enough, i.e. e\ < and K\ ~ eiA^ 1 . 
Then there is a set r B l C ft > with 

mes [BjJ < k ei e 1 (5.6) 

such that ft \ S 1 is a union of about (ko^ 1 ) 2 squares, where each such 
square has the form {R^ = I\ X J\, with \l\\, Mi ~ K\. 
For each of these squares we have: 

min |/(x)| > ei (5-7) 



Proof. We will use Lemma 5.3 Iq is covered by about kok^ 1 subintervals Ij 
of length K\. Consider one such subinterval. There is a C 1 function (j)j{xi) 
on Ij such that \d Xl 4>j(xi)\ < Acq 1 and 

[(x 1; x 2 ) G Ij x J : |/(xi,x 2 )| < ei] C ft* U ftj U 
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where if Jo = [co,c£o] then 



Ij x [d - 2eie , d ] 
Ij x [cq, c + 2eie " 1 ] 
[(xi, x 2 ) : xi G I,-, |x 2 - 0j(a7i)| < e^ 1 ] 



Then 

S 7 - C Ij x [min 4>j(xi) - ei£g \ max 4>j(x\) + eie^ x ] =: L- x if" 1 =: ft" 1 

For any x\, x[ G Ij we have 

|</)j(xi) - (j)j{x' x )\ < AeQ 1 ■ |xi -ari| < Ae^Ki ~ eie^ 1 
which shows that 

l K j I ~ £ i e o 

We have shown that [(xi,X2) G Ij x J : |/(xi,X2)| < ei] is covered by 
three rectangles: ftj, ftj, ft™, each of the form Ij x Kj where |/j| ~ ki, 

Summing over j < kqk^ 1 , we get that the set [x G ft : |/(x)| < ei] is 
contained in the union S 1 of about kqk^ 1 rectangles of size m x eieQ 1 . Then 

mes < k /% 1 • ki • eie 1 = ko€i€q 1 



which proves (5.6) 



The complement of this set, ft \ S l3 consists of about the same number 
ko/% of rectangles - this was the reason for switching from ei-neighborhoods 
of curves to rectangles. Each of these rectangles has the form Ij x Lj, where 
\lj\ ~ K\ and \l>j \ ~ kq — 0(€\€q 1 ) ~ kq 3> «i. Divide each of these vertical 
rectangles into about kq^i 1 squares of size K\ x k\ each. 

We conclude that ft \ S x is covered by about (kqk^ 1 ) 2 squares of the 
form %y = I\ X J\ , where the size of each square is 1 1\ | , | J\ | ~ aci . 

□ 

We now have all the ingredients for proving the following Lojasiewicz type 
inequality. 

Theorem 5.1. Assume that v(x) is a smooth function on [0, l] 2 satisfying 
the transversality condition Then for every e > 

sup mes [x G [0, l] 2 : \v(x) — E\ < e] < C ■ e b (5.8) 



where C, b > depend only on v. 
Proof. Using Lemma |5.1[ 

3m = (mi, 7712) G N 2 \m\ / 3c > : Vx G T 2 max|<9-i>(x)| > c 



a. <m 
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Let 



A := max max \d— v(x) 

Q<(mi+l,m 2 + l) xe[0,l] 2 



We may of course assume that \E\ < 2 A, otherwise there is nothing to 
prove. 

All the constants in the estimates that follow will depend only on \m\ , c, A 
(so in particular only on v). 

Partition [0, l] 2 into about (^r) 2 squares of the form 3? = / x J of size 

I 7 M j I~m- 

Let Jl be such a square. Then either > e for all x G 3?, in which 

case we are done with this square, or for some a = (ai, 02) € 9? we have 
\v(a)\ < e. But then for one of the partial derivatives a < m, \a\ 7^ 0, we 
have \d— v(a)\ > c. 

Assume for simplicity that |<9— v(a)\ > c, which is the worst case scenario. 

If x E then ||x — a||oo := max{|xi — a\\ , \x2 — «2|} < 2^- 

Then 

\d^v(x) - d^v(a)\ < max \V&^v(y)\ ■ \\x - alU <A- — = - 

1 1 y£$. 1 — 1 2A 2 

It follows that 

c 



mm 



d^v(x)\ 



> 



ml =: m times. 



We will use Lemma 5.4 
■ Step 1. Let 

f x {x) := 5 (mi ' m2 " 1) v(x) 

Then 

min 9 X2 /i(x) = min \d— v (x)\ > 



We apply Lemma 5.4 to the function f± with the following data: 
2<o = & «o = 2^-, eo ~ c, ei < — ^— , «i ~ ei^4 

where ei will be chosen later. 

We get a set 2^ := S x , mes [S^] < Ko^o 1 < k^t! " e i e o 2 sucn tliat 
is a union of about (kokJ -1 ) 2 squares of the form ^ = li x Ji, of 
size \li , Ji ~ «i. For each of these squares we have: 



which means: 



mm 



min > ei 



■ Step 2. Pick any of the squares = /1 x Ji from the previous step 
and consider say 

f 2 (x) := d^-^-^vix) 
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Then 



mm\d x J 2 (x)\ = min|9 ( " ll ' m2 - 1) u(x)| > ej 



Apply Lemma 5.4 to the function f 2 with the following data: 



Si, «i, ei from Step 1, e 2 < Sf 1 , k 2 ~ £2-4 1 

where e 2 will be chosen later. 

We get a set 33 2 , mes [B 2 ] < Kie 2 €^ sucn that 3^ \ 23 2 is a union of 
about (k\K2 1 ) 2 squares of the form 3l 2 = I2 x J2, of size |/ 2 |, \ J 2 \ ~ «2- For 
each of these squares we have: 

min|/ 2 (x)| > e 2 

which means: 

mini g^i-l.ma-l) v(x)\ > e 2 

If we do this for each of the ~ (kqk^ 1 ) 2 squares resulting from Step 1, 
and if we put together all the 'bad' sets 33 2 corresponding to each of these 
squares, we conclude the following. 

There is a set S 2 C 31 such that: 

mes [S 2 ] < K\e 2 e{ x ■ (kqk^ 1 ) 2 = kqc^ 1 ^ 1 ~ KqA • e 2 e{ 2 

Hence the total measure of the 'bad' set in Step 2 is: 

mes [B 2 ] < k 2 A ■ e 2 e\ 2 

Moreover, 31 \ (B^ U 2 2 ) is covered by squares of the form 3i 2 = I 2 x J 2 , 
of size J2 , J2 ~ K 2 - 

The total number of such squares is about 

(«i«2 x ) 2 • (Ko^r 1 ) 2 = ( k ok 2 x ) 2 

On each of these squares we have: 

mini ^-l.ma-l) u(z)| > e 2 

It is clear how this procedure continues. Perform it for m — 1 steps. We 
will get sets 2^, . . . ,3^ m _ x such that %\ (# x U . . .US^j) consists of about 
(KoKm-i) 2 squares of the form 3i m _ 1 = I m _iX J m _i, of size |/ m _i|, |j m _i| ~ 
«; m _i. On each of these squares we have 

min I d X2 v(x) \ > e m _i or min I d x ,v(x)\ > e m _i 



■ Step m. Assume the former inequality above and apply Lemma 5.4 
one more time. Let 

f m (x) := f (x) - £ 



SCHRODINGER, OPERATORS WITH MULTIVARIABLE GEVREY POTENTIALS 25 



for some fixed energy E with \E\ < 2 A (the estimates will not depend on 
E). Then for each of the squares % m _\ from the previous step we have: 

min \d X2 f m (x)\= min \d X2 v(x)\ > e m _i 



Apply Lemma 5.4 to the function f m with the following data: 

^Rm-i) K m-i, e m -l from the previous step, e m < ?™=^=i ) Km ^ e m A~ l 

where e m will be chosen later. 

We get a set 23 m , mes [!B m ] < Km.ieme^j such that ^ m _i \ 23 m is a 
union of about {K m -\K~}) 2 squares of the form % m = I m x J m , of size 
\lm L I JW I ~ K m- For each of these squares we have: 



which means: 



min |/m(x)| > e r 



min \v(x) — E \ > e r 



If we do this for each of the ~ (^o^m-i) 2 squares resulting from the 
previous step, and if we put together all the corresponding 'bad' sets, we 
conclude. 

There is a set r B^ n C 3? such that: 



2 

«i c TO -l 



Hence the total measure of the 'bad' set in Step m is: 

mes [S m ] < k\A ■ e m e m 2 _ x 
Moreover, 3i \ (33^ UB^.-U r B a m ) is covered by squares of the form % 

I'm 

x J m , of size 

The total number of such squares is about 
On each of these squares we have: 



min \v(x) — E > e 



Therefore, the total measure of the bad set from all steps is: 

mes [ti U t 2 ■ ■ • U Bjj £ ■ [e^o 2 + + ■ ■ ■ e - e ™-i] (5-9) 
We choose 

i lorn — j 

6j := e ' for 1 < j < m 

If e < e*(c, m), then eo ~ c > e 1 / 3 " 1 , e~ 2 ~ c~ 2 < e~ 2 , so there is no harm 
in also putting (for simplicity) eo = e 1 / 3 '". 

It is a simple calculation to see that for any e < e*(m, A), we have tj+i < 
for all j = ... m — 1, which allows our inductive process to work. 
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Note that e 3 = ej+i so Cj+i€j 2 = ej. This implies: 

o 9 9 1 /'V 1 

eie + £ 2£i + . . •e m e m _ 1 = e + ei + . . . + e m _i < me = m, • e 7 



From (5.9) it follows that the total measure of the bad set inside the 



square 31 is: 

mes U . . . U Sjj < «g A m • e 1 / 3 '" 

There are about (^r) 2 = Kq 2 such squares. 

We conclude that outside a bad set 2, mes [B] < Am ■ e 1//3m , we have 
|v(x) — i?| > e, which proves (5.1) with C ~ im and 6 = g^-. 

□ 



Remark 5.1. The exponent b in (5.8) is related to the Lojasiewicz exponent 



of the function v (see [T7], |10|). Determining the optimal exponent in 
such an inequality is an interesting problem in itself, and has been studied 
extensively for polynomials and analytic functions. It is clear that for a 
polynomial, the Lojasiewicz exponent should be related to its degree d, and 
it is in fact shown to be O(-p) with explicit underlying constants (see [TO] . 
|17|). The proof of the Lojasiewicz inequality for analytic functions in |12j 
(see Lemma 11.4 there) does not provide an explicit value for the exponent, 
but Theorem 4 in |21| provides a scheme for computing it via the Newton 
distance of v. 

In our proof for smooth, transversal functions, we obtain the exponent j^r, 
where m is the maximum number of partial derivatives needed for transver- 
sality. If v were a polynomial of degree d, then m would be d, which shows 
that our estimate is very wasteful (we have obtained a better estimate, 
0{— ), for one-variable functions, see Lemma 5.3 in [16J). This, however, 
seems to be the only such estimate available now for non-analytic functions 
of two variables. 

A similar argument can be made for functions of more than two variables, 



so (5.8) will hold for such functions as well. 



6. Large deviation theorem, the proof of main results 
Using induction on the scale N, we will prove the large deviation esti- 



mate (2.2) for the logarithmic average of transfer matrices: 

mes [xeT 2 : \^log\\M N (x, E)\\ - L N (E)\ > N~ T ] < e~ N ° 

as well as a lower bound on the mean of these quantities: 

L N (E) > 77V log |A| 



The base step of the induction uses the quantitative description (5.8) of 
the transversality condition (1.12) on the potential function, and the large 
size of the coupling constant. The inductive step uses only the regularity of 
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the potential function via Lemma 3.1 which provides a good approximation 
of these logarithmic averages by pluri-subharmonic functions. 

Lemma 6.1. (Base step of the induction) Assume that v(x) is smooth and 



satisfies the transversality condition (1.12). Then given any constant C > 



there are positive constants Ai and B which depend on v and C , such that 
for any scale No, for any X subject to |A| > max{Ai, Nq } and for any E £ E 
we have: 

mes [xeT 2 : | ~ log\\M No (x, A, E)\\ - L No (X, E)\ > ~ 5(A)' 

iVn 



20 



<N C 



Furthermore, for these X, Nq and for all E we have: 

L No (\,E)>±S(\) 

L No (X,E)-L 2No (X,E)<^S(X) 



.1) 



3.2) 



(6.3) 



Proof. The proof of this result is similar to the analytic potential func- 
tion case. That is because the only fact about analyticity needed here is 



the Lojasiewicz inequality (5.8), which holds for any non-constant analytic 



functions, and which we have established in section [5] for smooth functions 
satisfying the transversality condition (1.12). We will then omit the proof, 
but the reader is referred to the proof of Lemma 2.10 in [6] for details. □ 

We will now explain the idea of the proof of the inductive step. 



If at scale iVo we apply the almost invariance property (3.18) n times and 
then average, we get: 

"~ J nS(X) 

n 



1 "- 1 

\L No (x) - - ^Lat (T j x)| < 



j=0 



No 



(6.4) 



so using the approximation (|3.16|), we also get: 
\u No (x) 



1 



n 



n-l 

E 



u*„(T'x)|<^ 



.5) 



To have a decay above, we need to take a smaller number of shifts n -C A^o 



tions to un (x) and get: 



Apply the estimate (4.1) on averages of shifts of pluri-subharmonic func- 



mes \x G 



^ n— 1 

T 2 :|-V' 
n ^ 



u No (T j x) - (u Nq ) 



3=0 



S 

> n 

PN 



T ] 



< e 



(6.6) 



We may combine (6.5), (6.6) to directly obtain a large deviation estimate 



for ujy {x) and then, via the approximations (3.16), (3.17) to obtain the 

S , 

PN ' 



LDT for Ljv (x), only when the deviation Sm T " <C 1. In other words, this 



approach works only when the scaling factor is not too large to cancel the 
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decay n r °. This is the case of the single or multi- frequency shift model with 
analytic potential (see [2], [5]) where = ^ is just a constant depending on 
the potential function v. This approach also works for the szngZe-frequency 
model with potential function in a Gevrey class of order s < 2, since in 



this case sharper estimates than (6.6) are available for averages of shifts of 



single- variable subharmonic functions (see [IS]). This approach fails for the 
skew-shift model (whether the potential function is analytic or Gevrey) and 
also for the multi-frequency model with Gevrey potential function, because 
the size pn of the subharmonic extension depends on the scale Nq. 

Therefore, in order to beat the scaling factor J^- when applying the 



estimate (6.6) to a transfer matrix substitute un (x) at scale Nq, we need 
to consider a large number of shifts n ^> Nq. The averages of shifts thus 
obtained will be close to the mean (un )- Moreover, we will get: 

(i) (2) i £pi 

L No ~ (u No ) ~ -2^u No (T J x) 



n 

j=0 



(3) 



-E^ lo gH M ^(T^)|| ~ ^-log\\M nNo (x)\\ 

3=0 



The first approximation above is just (3.17). The second is exactly (6.6). 



The third is due to (3.16). The last approximation above essentially says 
that: 

n— 1 n— 1 

l[\\M No (Vx)\\ « \\HM No (Vx)\\ « \\M nNo (x)\\ 

3=0 3=0 
or in other words, that the product of the norms of certain transfer matrices 
is approximately equal to the norm of the product of these matrices, the 
latter giving us the transfer matrix at the larger scale uNq. 
If these heuristics were true, then for ji > JVo we would get 

l n ~ —=-log\\M nNo (x)\\ 
nl\Q 

which would establish the large deviation estimate for transfer matrices at 
a larger scale uNq. 

The avalanche principle, which is a deterministic result, describes how 
estimates on the norms of individual (and of products of two consecutive) 
matrices can lead to estimates on the norm of the product of all 
matrices (see [12], [3]), thus providing the basis for establishing the above 
heuristics. It requires a uniform lower bound on the norms of individual 
matrices in the product, as well as knowing that the norm of the product of 
any two consecutive matrices is comparable to the product of their norms. 

The following lemma provides the inductive step in proving the LDT for 
an increasing sequence of scales N. It also provides the inductive step in 
proving the positivity and continuity of the Lyapunov exponent. The proof 
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of this lemma is based on the heuristics described above, and combines the 



averages of shifts estimate (4.1), the almost invariance property (3.18) and 
the avalanche principle (see Proposition 2.2 in |12j). 

Before stating the lemma let us describe the various parameters and con- 
stants that will appear. 

List of constants and parameters: 

■ s > 1 is the order of the Gevrey class. 

■ 5 = 2(s — 1) refers to the size (~ N~ s ) of the holomorphic extensions of 
the transfer matrix substitutes. 

■ D := 25 + 8, A := max{^^, 2} are some well chosen powers of the 
scale N, tq is the exponent from (4.1). 

■ 7 > \ is a fixed number. 

Note that all these constants are either universal or depend on the order 
s of the Gevrey class. 

■ A, E are fixed parameters such that \E\ < |A|i? + 2, and B := sup|v(x)|. 

■ The transformation T = where uj € DC K or T = T w where cj £ DC K 
for some k > 0. 

■ ./Voo = Noo(s, k, B) is a sufficiently large integer, such that the asymptotic 
behavior of various powers and exponentials applies to iVoo and such that 

holds for N m shifts. 

Lemma 6.2. (The inductive step) Consider two scales Nq and N such that 
N$ > iVoo> (3.16) holds at scale Nq, that is: 

N > S(X) O |A| < (6.7) 



and 



Nq < N < e No 



Assume that a weak LDT holds at scales Nq and 2Nq: 
mes [xeT 2 : \^-log\\M No (x,\,E)\\-L No (\,E)\ > ^-S(X)} < N~ D (6.9) 

1\Q 1U 

mes [i£T 2 : |-L \og\\M 2No {x, A, E)\\ - L 2No (X, E)\ > ^S(X)} < N~ D 

, (6 - 10) 

and that the means L^ , L 2 n have a lower bound and are close to each 
other: 

L No (X,E), L 2No (X,E) > 75(A) (6.11) 
L No (X,E)-L 2No (X,E) < gS(X) (6.12) 

Then similar (but stronger) estimates hold at the larger scale N: 

mes [x e T 2 : \ — \og\\M N (x, X, E)\\ -L N {X,E)\ > S(X)N- T ] < e~ N ' 

(6.13) 
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L N (X,E) > 7 5(A) (6.14) 
-2[L No (X,E) - L 2No (X,E)} - C'oS^NoN- 1 
L N (X,E)-L 2N (X,E) < C S(X)N N^ 1 (6.15) 

for some positive absolute constants Co,t,o~. 

Proof. The parameters A, E and the transformation T = S w or T = T w 
are fixed, so they can be suppressed from notations. For instance Mjy(x) = 
M N (x,X,E), 5(A) = S etc. 

■ We can assume without loss of generality that N is a multiple of Nq, 
that is, that N = n ■ N . Indeed, if N = n ■ N + r, < r < JV , then 

|4log||Mjv(as)|| V log\\M n . No (x)\\\ < 2SAW- 1 (6.16) 

1 iV n ■ Nq 

Therefore, if we prove (6.14), ( |6.15 ), (6.13) at scale n- Nq, then they hold 
at scale N too. 

To prove (6.16), first note that Mjv(x) = B(x) ■ M n .N (x), where 



n-N +l n-N +l 

B{x) := \ \ A(T j x) = Yl A ( T 

j=N j=n-N +r 



'J, 



so 



\\B(x)\\ < e r ' s < e N(rS and ll^fe) -1 ]! < e r ' s < e N °- s 
Since ||M n .jv (^)|| > 1 and \\M N (x)\\ > 1, it follows that: 



ilog||M„fe)||-^ 



\og\\M n . No (x)\\ 



1 log \\M N (x)\\^ 



n-N 



M n . No (x) 



n ■ Nr\ n ■ Nr> 

< _J_i og &)ll^ • l|M„.^ (x)||^ 



//•An'" \\M n . No (x)\\ 

\og(e N ° s )^ = SN Q N- 



~ n-N( 



Similarly 
1 



1 



log||M n . J v fe)||--log||M iV (x)|K „ ~ B 
"•-No A* n-AT || MAr ^||^o 



1 ||M n .jv (z) 

log 



1 



< 



n- iV 
1 



log 



\M n . No (x )\\\ — 
\\M N (x}\ 



\M n . No {x)\\% 



\og[\\(B(x))- L \\^ .\\M n . No (x)\\N 



n-N 

< log [(e^ 5 ) 2 ^ • {e nNoS )^r] = 2SN N 



-i 



and inequality (6.16) now follows. 
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■ We are going to show that (6.8) - (6.12) allow us to apply the avalanche 
principle to the "blocks" Mjy (Tv^ N ° x), for j = 1, n. Each of these blocks 
is a product of No matrices, and they multiply up to Mn(x). 

Denote the set in (6.9) by Bjy and similarly the set in (6.10) by B 2 n q . 
If x ^ Bjy then using (6.9), (6.11) and (6.8) we get 

97 . 



||Afjv (z)|| > e -^SN +L NQ - N > eJ lSN =:fjL>e N > N>n 



so 



II M No (x) || > /i >n ifx(£B No 



For 1 < j < n 



(6.17) implies 



(6.17) 

£ consider Aj = Aj(x) := M No {T^-^ N °x). Then 

n 

(6-18) 



min \\Ai(x)\\ > u for all x 4 I J T~ jN °B Nn 

Kj<n J s —'« r W 

3=0 



Since A,-+i(V) • Aj(ac) = M 2No (T^~ 1 ) N °x), using ( |6j| ), ( |6.10[ ), ( |6.12[ ), for 
x $ \J] =0 {T- jN °B No ) U U™=o( T ~ ~ jN ° B 2N ) (which is a set of measure 
< 2N- D ■ N = 2N~ D+1 ), we have : 

log||A, + i(x)|| + log||A 3 -(x)|| - \og\\A j+1 (x) ■ Aj[x)\\ 

= log||M^(T^x)|| + logHM^^-^x)!! - ]og\\M mo (T^ N °x)\\ 

< N (L Nq + ^) + JVo(^o + fo 1 ) + 2Ar o(f^ - L 2No ) 

1 



Therefore, 



45t 95^ 
2iVo(Liv - L 2No ) + -^iV < -^-No = - log/, 



1 



log||A i+1 (x)|| +logp,(x)|| -\og\\A j+1 (x) -A 3 (x)\\ < 2 lo g/ u (6.19) 

for x outside a set of measure < 2N~ D+1 . 

Estimates (6.18), (6.19) are exactly the assumptions in the avalanche 
principle (Proposition 2.2 in |12j). We then conclude: 

n—l n—1 
|log||A n (x) • . . . • Ai(x)\\ + ^2\og\\Aj(x)\\ - J2 l °s\\A j+ x(x) ■ Ai(x)\\\ < 

3=2 3=1 



for x outside a set of measure < 2N D+1 . 

Hence, since N = n ■ No and A n (x) ■ . . . ■ A\{x) = Mn(x), we have: 

n-l 

|log||M*(aO|| + 2 lo gll M ^(T°'- 1)JVo x)|| 

3=2 

n-l 

-^og\\M 2No CT^ N "x) 

3=1 



n 
/' 

(6.20) 



< 



n 
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Therefore 



n-l 



\Uog\\M N (x)\\ + ^^r^g\\M No (T^- 1 ^ 

3=2 ° 



x 



n-l 



^-Llog||M 2iVo (Ty- 



1)^0, 



n^2N 

3=1 



< 



(6.21) 



■ We w ill go from averages of n blocks in (6.21), to averages of N shifts. 
In (6.21) replace x by x, Tx, . . . T^ 0-1 ;^ and then average (i.e. add up all 
these Nq inequalities and divide by iVo) to get: 

JVo-l N-l 
— ^ 1 °gl|M J v(T^)|| + -^— log\\M No (Vx)\\ 



u j=o 



3=0 U 



N-l 



AT E a^^ll^oC^III <^ (6-22 



The almost invariance property - Lemma (3.18) implies: 

No-l 



1 

N 



log||.A/A!£!|! - ^ £ ~log\\M N (Vx)\\\ < ^° 



iV f-; AT 

3=0 



(6.23) 



From (6.22) and (6.23) we get: 



N-l 



Uog\\M N (x)\\ + 1 ^ ^log||M % (T^)|| 
j=o 



iV-1 



X; ~log||M 2A r (T%)||| < ^° + I < 5iV iV-i 



iV 27V 



(6.24) 



-Z>+1 



for x^B i: = {j;=o( T ~ JB N )U\J] =0 (T-iB 2No ) where mes [BJ < 2JV" 
Integrating the left hand side of ( |6.24 | in x, we get: 

\L N + L No - 2L 2No I < CSNqN- 1 + AS ■ 2N-° +l < CoSNoN' 1 (6.25) 

Ln + Ln — 2L 2 n > —CqSNqN^ 1 

Ln > Ln — 2(Ln — L 2 n ) — CqSNqN^ 1 

> 7 5 - 2(L No - L 2No ) - CoSNqN- 1 



which proves (6.14). 



Clearly all the arguments above work for iV replaced by 2N, so we get 

(6.26) 



the analogue of (6.25) : 



L 2 n + Ln — 2L 2 n I < CqSNqN 1 



From (6.25) and (6.26) we obtain 

L N - L 2N < CqSNqN- 1 
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which is exactly (6.15). 



■ To prove the LDT (6.13) at scale N, we are going to apply the estimate 
(4.1) on averages of shifts of pluri-subharmonic functions to the transfer 
matrix substitutes un and U2N - Their widths of subharmonicity in each 
variable are pn q , P2N ~ -^o~ 5_1 an d they are uniformly bounded by S. 

Using ( 3.16 ) which holds at scales iVo and 2Nq due to (6.7), we can 'sub- 
stitute' in(|6~24) ^ \og\\M No (T^ {x)\\ by u No (T^x) and ^ log||M 2A r (T%)|| 
by U2N (^ 3 x) and get, for x B\\ 

1 1 N ~ 1 2 JV_1 

|-log||M iV (x)|| + - m (T j x)-- Yl ^iv (T%)| < SNoN- 1 (6.27) 

3=0 j=0 



Applying (4.1) to um and U2N we get : 
1 N ^ 

mes [x G T 2 : |- ]T un (T*x)-(u No )\ > S-N^-N'^] < 



-N a o 



(6.28) 



3=0 
N-1 



mes 



k e T2 : 1^ E W^) - (« 2iVo )| > S • 7V 5+1 • N~ T °] < 



3=0 



Denote the union of the two sets in (6.28), (6.29) by B2. 
Since N satisfies (6.8), 

S ■ N* +1 ■ N- T ° < S ■ (N 1/A ) 5+1 ■ N~ T ° < S ■ N~ T1 where n < 
so from ( |6.27D , fl6.28| ), @ we get: 

1 \0g\\M N (x)\\ + (u Nq ) - 2(U2N )\ 



(6.29) 



70 



iV 



< SNoN' 1 + 5 • A^^ T1 < S ■ N- Tl 
for x ^ B := BiL) B2, where 

mes [B] < 2N- D+1 + 2e~ N ' < W-° +1 < N-° +2 



(6.30) 



Using (3.17) at scales iVo, 2iVo and taking into account (6.8), estimate 
( 6.30| ) becomes: 

\^\og\\M N {x)\\ + L No - 2L 2No \ < 2S-N~ T > + 2e~ N o < 3SN~ T ^ (6.31) 
provided x £ B. 



Combine (6.31) with (6.25) to get: 
, 1 



N 



log||Mjv(x)|| - L N \ < CqSNqN' 1 + 3S ■ N~ Tl < S ■ N' 



(6.32) 



for all x £ B, where mes [B] < N~ D+2 and r 2 < t x . 



However, (6.32) is not exactly what we need in order to prove the es- 
timate (6.13). We have to prove an estimate like (6.32) for x outside an 
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exponentially small set, and we only have it outside a polynomially small 
set. To boost this estimate, we employ again Lemma |4.1| 



From (6.32), using again (3.16), (3.17) at scale N, we get: 
mes [iET 2 : \u n {x) - (u N ) \>S- iV~ T2 ] < N~ D+2 



(6.33) 



We apply Lemma 4.1 to u(x) := ^uj\f(x), which is a pluri-subharmonic 
function on the strip A , with upper bound B = 1 on this strip. 



Estimate (6.33) implies 



mes [x E T 2 : \u(x) - («) | > N~ T2 } < N-° +2 (6.34) 
Then for e := N~ T2 , e x := N- D+2 , B = 1, p = p N « N^- 1 we have 




N ~T2/i + N — N - 



D+2 



N -T2/4 + N -l N r 2 /2 < N - 



for some positive constant a±. 



The conclusion (4.6) of Lemma 4.1 then boosts (6.33) from a small devia- 



tion outside a polynomially small set, to one outside an exponentially small 
set, amid a small power loss in the deviation: 

mes [x E T 2 : \u N (x) - (u N )\ > S AT^/ 4 ] < e"^ 1 < e - N " (6.35) 

which proves estimate (6.13). □ 



Remark 6.1. The scaling factor " in estimate (|4.6|) of Lemma 



4.1 



is what 



prevents this approach via polynomial approximation to extend to more 
general Carleman classes of potential functions. This is because when the 
estimates on the Fourier coefficients of the potential function are weaker than 



estimate (1.11) for Gevrey functions, the size p = pn of the holomorphic 



extension of the A^th transfer matrix substitute will cancel any decay in the 

P eo 1 ^ 



expression 



We will combine the base step (Lemma 6.1) with the inductive step 
(Lemma 6.2) to prove the large deviation estimate for transfer matrices 
and the positivity of the Lyapunov exponent. The proof of the LDT will 
also provide us with the major ingredient for deriving the continuity of the 
Lyapunov exponent. 



Theorem 6.1. Consider the Schrddinger operator (1.1) on l 2 (Z): 
[H(x)rp] n - 



-i*n 



+1 



^n-i + Ai>(T n x) *lj n 



where the transformation T is either the skew-shift (1.4) or the multi-frequency 



shift (1.6). Assume that for some k > the underlying frequency satisfies 



the Diophantine condition DC K described in (1.8) or (1.9) respectively. 



Assume moreover that the potential function v(x) 
class G S (T 2 ) and that it is transversal as in (1.12). 



belongs to a Gevrey 
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Then there exists Ao = Xq(v,k) so that for every fixed A with |A| > Ao 
and for every energy E, we have: 

mes [x 6 T 2 : \ — log\\M N (x,X,E)\\ -L N (X,E)\ > N~ T ] < e~ N ' (6.36) 

for some absolute constants r, a > 0, and for all N > Nq(\, k, v, s). 

Furthermore, for every such transformation T and coupling constant A 
and for all energies E 6l we have: 

L(X,E) > ^ log| A| > (6.37) 



Proof. We refer to the list of constants preceding Lemma 6.2 



We use the initial step - Lemma 6.1 at a sufficiently large initial scale 
No > iVoo = Nqq(v). We will explain how the scale iVo is chosen later. We 
get constants Ai, B > such that for every A with |A| > max{Ai, (2No) B } 
(we want Lemma 6.1 to apply at both scales iVo and 2Nq) we have: 

mes [x£T 2 : \±-]og\\M No (x)\\ - L No \ > ^S] < N^' < N~ D (6.38) 

mes [xGT 2 : log ||M 2JVo (x)|| - L 2No \ > ^S] 

< (2N )- A2 - D < N-° (6.39) 



Ln , L 2 n 
L No - L 2No < ^ S 



Of course (6.38) and (6.39) hold provided N satisfies: 

tA 2 



N < N, 



o 



Estimates (6.38) - (6.41) above are exa ctly the assumptions (|6.9|) 
(at scale iVo, with 7 = 70 = \ 



in Lemma 



6.2 



(6.40) 
(6.41) 

(6.42) 



- fl6l2l ) 



of the inductive step of LDT. 



However, in order to apply this inductive step lemma and obtain similar 
estimates at the larger scale N, the initial scale iVo and the disorder A have 



to satisfy the condition (6.7). Together with the conditions on A and A^o 



from the initial step (Lemma 6.1), Nq and A have to satisfy: 



No 



(2iV ) < |A| < e 

N > N 00 
\X\ > Ai 



(6.43) 
(6.44) 
(6.45) 



We want to prove the LDT for every disorder A large enough, |A| > Ao 
and not just for A in a bounded interval as in (6.43). To do that, we will 
have to first choose A large enough, and then to pick iVo = -/Vo(A) > iVoo 
appropriately. Here is how we can accomplish that. 
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The condition (6.43) is equivalent to 

log |A| < A < l\M 1/B 



(6.46) 



We can find Ao large enough, Ao = \o(v,k), Ao > Ai, so that if |A| > Ao, 
then 

log |A| > A 00 and log |A| < J|A| 1/B (6.47) 



Then for every such A we can pick Ad = Ad (A) so that (6.46) holds. 
Combining this with ( |6.47[ ), we get that ( |6.43[ ), fl6.44[ ), ( |6.45[ ) hold. 

All the assumptions on the small scale Ao in the inductive step - Lemma 

(6.48) 



6.2 hold now, so if we choose the large scale A such that 

A A < A< A 42 (< e^ ) 



then (6.42) and (6.8) hold, so we can apply Lemma 6.2 to get: 

1 



mes [ief: |— log||M/v(x)|| - L N \ > SN~ T ] < e 



-N° 



(6.49) 

(6.50) 
(6.51) 



L N > loS - 2{L No - L 2No ) - CoSNqN- 1 
L N ~L 2N <C SN N- 1 

for some positive absolute constants Co,r, a. 

Estimate ( |6.49[ ) proves the LDT ( |6.36[ ) a t scal e A i n the r ange [N^, N^ 2 
If Ai is in this range, say Ai = Nq, then (6.51) and (6.50) imply: 

L Nl - L 2Nl < CqSNqN- 1 
L Nl >loS-3 CoSNoN- 1 = 7o S - 3 C N A+1 S =: 71 • S 

where 



7i := 7o - 3 C A ( 



-A+l 



1 



3 C A, 



-A+l 



> 



1 



provided we chose Ado (and so Ao) large enough depending on A, Cq. 
Therefore we have: 

L Nl > -yiS (6.52) 

and 

L Nl - L 2Nl < CqSNoN- 1 = C SN Q A+1 < -L • 5 < ^ • S 



so 



Lni ~ L 2Nl < ^ • S 



(6.53) 



Since 2Ai = 2N^ is in the range [N^,n£ }, ( |6.52| ) holds at scale 2Ai 
too, so we have: 

L Nl ,L 2Nl > 71 5 (6.54) 
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Choosing the next large scale N2 so that N A < N 2 < N^ 2 {< e Nl ), we 



have e~ N ° < N{ a2 ' d < N 2 D , so (|6.49| implies: 



mes \x 



G T 2 : |i-log||M Wl (x)|| -L Nl \> ±S\ < e' N ? < N 2 D (6.55) 



mes [x G T 2 



! 2iVi 



log||M 2iVl (x)|| - L 2Nl \ > -S] < N 2 D (6.56) 



Estimates (6.55), (6.56), (6.54), (6.53) are the assumptions in the in- 
ductive step - Lemma 6.2 with small scale N± and larg e scale N 2 , where 



N 2 G [Nf,Nf] = [N£',N£°]. Applying Lemma 
(6.36) for N 2 in this range. Moreover, we get: 



6.2 



we get the LDT 



Ln 2 ~ L2N2 < CqSNiN 2 1 



and 



L N2 > 71 5 - 2(L Nl - L 2Nl ) - CoSNtN^ 1 > ( 7l - 3C N{ A+l ) ■ S =: 72 • S 



where 



72 := 7i - 3C iV{ 



-A+l 



> 



1 



3C N Q A+1 



M-A+l) > 1 



2 " u u u 4 

again, provided A^o (thus Nq) was chosen large enough depending on A, Co. 
Hence we have L^ 2 > 72 • S and L^ 2 — L 2 n 2 < 40 ' 
Continuing this inductively, we obtain (6.36) at every scale N > N A . 
Also, at each step k in the induction process, if N G [N A ,N A ], then 
Ln > Ik ■ S > I ■ S 



so 



L = inf Ln > - ■ S 
N 4 



and (6.37) is proven. 



□ 



We now prove that the Lyapunov exponent is continuous as a function of 
the energy. 



Theorem 6.2. Under the same conditions as in Theorem 6.1 above, and 
for any |A| > \q{v, k), the Lyapunov exponent L(E) is a continuous function 
of the energy E with modulus of continuity on each compact interval £ at 
least: 

w(t) = C (log^)- /3 (6.57) 

where C = C(£, A, v, k, s) and (3 G (0, 1) is a universal constant that can be 
chosen, at the expense of C , to be arbitrarily close to 1. 

Proof. We will fix A, T and omit them from notations. We also fix the 
compact interval £. 

It is easy to show (see below) that for every scale N, the functions L^(E) 
are (Lipschitz) continuous. To prove that their limits L(E) are also continu- 
ous with a certain modulus of continuity, we need a quantitative description 
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of the convergence L^{E) — > L{E) as ./V — > oo. The better this rate of 
convergence, the sharper the modulus of continuity of L{E). 

It follows from the proof of Theorem 6.1 above (see ( 6.51| ) and the in- 
ductive process thereafter) that for every scales Nq and N such that Nq > 
N 00 (X, v, k) and N$ < N < Nf, we have: 

L N (E) - L 2N (E) < NqN- 1 < N l / A N- 1 =: N" P 



so 

L N (E)- L 2N {E) < N~e for all N > N 00 
Summing up over dyadic Af's we conclude: 

L N (E)- L(E) < N'P for all N > N 00 
which is the quantitative convergence we were seeking. 
To show that 

1 



(6.58) 
(6.59) 



L N (E) 



N 



T 2 



log\\M N (x, E)\\dx 



are continuous, we use Trotter's formula for the transfer matrix Mtv(x, E): 

M N (x,E) - M N (x,E') = 



N 



A(T N x, E)... [A{T j x, E) - A(T j x, E')] ...A(T x, E') 



But 



and 



so 



E'-E 




A(T^x,E) - A(T J x,E') = 

\\A(T j x,E)\\ < e s forall£e£ 



\\M N (x,E) - M N (x,E')\\ < e SN \E-E'\ 

Therefore, since \\M N (x,E)\\ > 1 and \\M N (x,E')\\ ^ h we nave: 

\log\\M N (x,E)\\-log\\M N (x,E')\\\ 

< \\M N {x,E) -M N (x,E')\\ <e SN \E-E'\ 

Integrating in x we obtain: 

\L N {E) -L N {E')\ < e SN \E-E'\ (6.60) 

which shows Lipschitz continuity for the maps L^(E). 
Combining (6.59) and (6.60) we obtain: 

\L{E) -L(E')\ <N- + e SN \E-E'\ for all N > N 00 (\,v, k) (6.61) 
For every such TV let 



\E-E'\ 



SN N -p 



so 



\L(E)-L(E')\<N-P 
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Since 



we have 



log- 



E-E> 



SN + plogN <SN 



N~ 



(log 



E — E' 



id' 



C (log 



E-E 



7t)" 



where C = C(X,v,k). 

We conclude, using the compactness of £, that for some constant C = 
C(£, A, v, k), and for a constant (3 that can be chosen arbitrarily close to 1 
by starting off with a large enough constant A, we have: 

\L(E)-L(E')\<C (log j^^y? 



□ 



Remark 6.2. The rate of convergence (6.59) can be improved to 

\L(E) + L N (E)-2L 2N (E)\<e- cNV for all A > A 00 (6.62) 

which follows from the proof of the inductive step, Lemma 6.2 (see estimate 
( 6.25| )) and uses the avalanche principle. This faster rate of convergence 
leads to the sharper modulus of continuity (1.14) (see [16], [6] for details). 



We will now explain how Anderson localization is derived from the large 
deviation theorem 16. 1[ 

Given the Schrodinger operator 

[H(x) il>] n := - Vn-i + A v(T n x) ^ n (6.63) 

for every scale we denote 

H N {x) := R [ljN] H(x)R[ ljN ] 

where R[i,n] is the coordinate restriction to [1, A] CZ with Dirichlet bound- 
ary conditions. 

Then the associated Green's functions are defined as 

G N (x,E) := [Hn^-E]- 1 

if the N x N matrix H~n(x) — E is invertible. 



The large deviation estimate (6.36) implies, via Cramer's rule, 'good 



bounds' on the Green's functions Gn(x,E) associated with (6.63) 
Indeed, for 1 < m < n 2 < A, we have: 

G N (x,E)(n 1 ,n 2 ) = [H N (x) - E]~ 1 (n ll n 2 ) 
det [fl" m _i(x) - E] ■ det [H N _ n2 (T n *x) - E] 



(6.64) 
(6.65) 



det [H N (x) - E] 

There is the following relation between transfer matrices and determi- 
nants: 

det [H N (x) -E] - det [iTjv-i(Tx) - E] 
det [H N -i(x) -E] - det [H N _ 2 (Tx) - E] 



M N (x, E) 



(6.66) 
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Therefore, we get the following estimate on the Green's functions: 



\G N (x,E)(n u n 2 )\ < 



\M n ,(x,E)\\-\\M N . 



"2 



r T n2 x,E) 



det (H N (x)-E)\ 



Combining this with the LDT (6.36), we obtain the following bounds on 



the Green's functions Ga(E,x) associated with the operator (6.63). 



For every N large enough and for every energy E, there is a set Qn(E) C 
T 2 with mes [fiTv(-E)] < e~ N " so that for any x ^ Qj^(E), one of the intervals 



A = A(s) = [1,N], [1, N - 1], [2,N], [2, N - 1] 



will satisfy : 



\G A (E,x)( ni ,n 2 )\ < e-clm-nal+tf 1 " 

2irii-x and | fi Q^| < Me"*l 1/S 
substituting in (6.67) v(x) by := X]|/|<C7V s v(l)e 2ml -'- 



Since = X^zez 2 v(l)e 



(6.67) 

for all 1 £ Z 2 , 
we can assume 



that the 'bad set' Un(E) above has not only exponentially small measure, 
but it also has bounded algebraic complexity - it is semi- algebraic of degree 



The rest of the proof of localization for (6.63) uses semi-algebraic set the- 



ory, and follows exactly the same pattern as the proof of the corresponding 
result for the analytic case (see [6] or Chapter 15 in [3]). 



Remark 6.3. We do not know if the transversality condition (1.12) is indeed 
necessary, either for the models considered here or for the single-variable 
shift considered in [16]. In particular, we do not know if the Lyapunov 
exponent is still positive throughout the spectrum for potential functions 
that have flat parts but are very smooth otherwise. This is a difficult and 
interesting problem. 

Finally, a more challenging problem regarding Gevrey potential functions 
is proving localization for a long range model, one where the Laplacian is 
replaced by a Toeplitz matrix. In the case of the skew-shift dynamics, this 
could lead to applications to more general quantum kicked rotator equations. 
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